Unsupervised Cross-Adaptation Approach for Speech Recognition by Combined Language Model and Acoustic Model Adaptation
نویسندگان
چکیده
The aim of this study is to improve speech recognition with a combination of language model (LM) and the acoustic model (AM) adaptation. The proposed adaptation techniques are based on cross-system adaptation or cross-validation (CV) adaptation. The principle is to use complementary information derived from several systems or data sets. Because language information and acoustic information differ completely, the combined approach is expected to be effective. We evaluate the performance of the proposed methods by conducting speech recognition experiments using the Corpus of Spontaneous Japanese (CSJ). Both cross-system adaptation and CV adaptation give better performance than the conventional adaptation method; the crosssystem adaptation method was found to exhibit the best recognition performance.
منابع مشابه
Unsupervised Language Model Adaptation Using Word Classes for Spontaneous Speech Recognition
This paper proposes an unsupervised, batch-type, class-based language model adaptation method for spontaneous speech recognition. The word classes are automatically determined by maximizing the average mutual information between the classes using a training set. A class-based language model is built based on recognition hypotheses obtained using a general word-based language model, and linearly...
متن کاملImprovement of Lecture Speech Recognition by Using Unsupervised Adaptation
The aim of this work is to improve the recognition performance of spontaneous speech. In order to achieve the purpose, the authors of this chapter propose new approaches of unsupervised adaptation for spontaneous speech and evaluate the methods by using diagonal-covariance and full-covariance hidden Markov models. In the adaptation procedure, both methods of language model (LM) adaptation and a...
متن کاملOnline Unsupervised Multilingual Acoustic Model Adaptation for Nonnative Asr
Automatic speech recognition (ASR) is currently one of the main research interests in computer science. Hence, many ASR systems are available in the market. Yet, the performance of speech and language recognition systems is poor on nonnative speech. The challenge for nonnative speech recognition is to maximize the accuracy of a speech recognition system when only a small amount of nonnative dat...
متن کاملUnsupervised Language and Acoustic Model Adaptation for Cross Domain Portability
This work investigates the task of porting a broadcast news recognition system to a conversational speech domain, for which only untranscribed acoustic data are available. An iterative adaptation procedure is proposed that alternatively generates automatic speech transcriptions and performs acoustic and language model adaptation. The procedure was applied on a tourist-information conversational...
متن کاملLive speech recognition in sports games by adaptation of acoustic model and language model
This paper proposes a method to automatically extract keywords from baseball radio speech through LVCSR for highlight scene retrieval. For robust recognition, we employed acoustic and language model adaptation. In acoustic model adaptation, supervised and unsupervised adaptations were carried out using MLLR+MAP. By this two level adaptation, word accuracy was improved by 28%. In language model ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011